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Reply to Office Action of June 8, 2007 

AMENDMENTS TO THE CLAIMS 

The following Listing of Claims, with amendment to claim 16 will replace all prior 
versions, and listings, of claims in the application. No new matter is introduced as a result 
of the following claim amendments. Note that the amendment to claim 1 6 is provided to 
correct an incorrect dependency noted by the Applicants. 

1 (Original). A system for encoding an audio signal, comprising: 
analyzing sequential segments of at least one digital audio signal to determine 

segment type as one of speech type segments, non-speech type segments, and unknown 
type segments; 

encoding each speech segment as one or more signal frames using a speech 
segment-specific encoder; 

encoding each non-speech frame as one or more signal frames using a non-speech 
segment-specific encoder; 

buffering each sequential unknown type segment in a segment buffer until analysis 
of a subsequent segment identifies the subsequent segment type as any of a speech 
segment and a silence segment; and 

encoding the buffered segments and the subsequent segment as one or more 
signal frames using the segment-specific encoder corresponding to the type of the 
subsequent segment. 

2 (Original). The system of claim 1 wherein the non-speech type segments include 
silence segments and noise segments. 

3 (Original). The system of claim 1 further comprising transmitting the encoded 
buffered segments as a burst transmission at a rate higher than a current sampling rate of 
the audio signal. 

4 (Original). The system of claim 1 further comprising flushing the segment buffer 
following each time the buffered segments and the subsequent segment are encoded. 
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5 (Original). The system of claim 1 wherein the sequential unknown type 
segments in the segment buffer are encoded using a different frame size than a frame size 
used for encoding speech type segments and non-speech type segments. 

6 (Original). The system of claim 5 wherein the sequential unknown type segments 
in the segment buffer are all encoded in a single frame. 

7 (Original). The system of claim 5 wherein the sequential frames present in the 
buffer are all encoded in two frames, wherein a first frame is encoded as a speech type 
frame, and a second frame is encoded as a non-speech type frame. 

8 (Original). The system of claim 1 further comprising searching the sequential 
unknown type segments in the segment buffer to identify an actual onset point of speech 
corresponding to speech identified in the current segment. 

9 (Original). The system of claim 8 wherein the sequential frames present in the 
buffer are all encoded in two groups of frames, wherein a first group comprising all 
buffered segments preceding a segment in which the actual onset point was identified are 
encoded as non-speech segments, and a second group comprising the segment in which 
the actual onset point was identified and all subsequent buffered segments are encoded 
as speech segments. 

1 0 (Original). The system of claim 3 further comprising a decoder for receiving the 
burst transmission, said decoder operating at a fixed frame rate. 

1 1 (Original). The system of claim 10 wherein the decoder uses extra samples 
contained in the burst transmission to populate a jitter buffer. 

1 2 (Original). The system of claim 3 further comprising a decoder for receiving the 
burst transmission, said decoder using an adaptive playout scheme. 
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13 (Original). The system of claim 12 wherein the decoder uses extra samples 
contained in the burst transmission to populate a jitter buffer. 

1 4 (Original). The system of claim 1 2 wherein the decoder compresses at least 
some of the received data to reduce average signal delay. 

1 5 (Original). A system for encoding speech onset in a signal, comprising: 
continuously analyzing and encoding sequential frames of at least one digital audio 

signal while analysis of the sequential frames indicates that the sequential frames is of a 
frame type including any of a speech type signal frame and a non-speech type signal 
frame; 

continuously analyzing and buffering sequential frames of the at least one digital 
audio signal while analysis of each sequential frame is unable to determine whether each 
sequential frame is of a frame type including any of the speech type signal frame and the 
non-speech type signal frame; 

automatically identifying at least one of the buffered sequential frames as having the 
same type as a current sequential frame when analysis of the current sequential frame 
indicates that it is of a frame type including any of the speech type signal frame and the 
non-speech type signal frame; and 

encoding the buffered sequential frames. 

1 6 (Currently Amended). The system of claim 4= 15 further comprising temporally 
compressing at least one of the buffered sequential frames prior to encoding those frames. 

17 (Original). The system of claim 16 further comprising searching the buffered 
sequential frames prior to temporally compressing those frames for identifying a speech 
onset point within one of the buffered sequential frames when the current sequential frame 
is a speech type signal frame. 
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18 (Original). The system of claim 17 wherein buffered sequential frames preceding 
the buffered sequential frame having the speech onset point are discarded prior to 
temporally compressing the buffered sequential frames. 

1 9 (Original). The system of claim 1 8 wherein initial samples in the frame having the 
speech onset point which precede the speech onset point are discarded prior to temporally 
compressing the buffered sequential frames. 

20 (Original). The system of claim 19, wherein a frame boundary of the buffered 
sequential frame having the speech onset point is reset to coincide with the identified 
speech onset point. 

21 (Original). The system of claim 1 5 wherein the at least one digital audio signal 
comprises a digital communications signal. 

22 (Original). The system of claim 15 further comprising flushing the buffer following 
encoding of the buffered sequential frames. 

23 (Original). The system of claim 15 wherein encoding any of the sequential 
frames and the buffered sequential frames comprises encoding those frames using a 
frame type-specific encoder corresponding to the type of each frame. 

24 (Original). A computer-implemented process for encoding at least one frame of a 
digital audio signal, comprising: 

encoding a current frame of the audio signal when it is determined that the current 
frame of the audio signal includes any of speech and non-speech; 

buffering the current frame of the audio signal in a frame buffer when it can not be 
determined whether the current frame of the audio signal includes any of speech and non- 
speech; 
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sequentially analyzing and buffering subsequent frames of the audio signal until 
analysis of the subsequent frames identifies a frame including any of speech and non- 
speech; 

temporally compressing each buffered frame; and 

encoding the temporally compressed frames as one or more signal frames. 

25 (Original). The computer-implemented process of claim 24 further comprising 
searching the buffered subsequent frames in the frame buffer, prior to temporally 
compressing each buffered frame, for identifying a speech onset point within one of the 
buffered sequential frames when analysis of the subsequent frames identifies a frame 
including speech. 

26 (Original). The computer-implemented process of claim 25 wherein buffered 
sequential frames preceding the buffered frame having the speech onset point are 
identified as silence frames. 

27 (Original). The computer implemented process of claim 26 wherein at least one 
of the silence frames are discarded from the frame buffer prior to temporally compressing 
the buffered sequential frames. 

28 (Original). The computer-implemented process of claim 24 wherein temporally 
compressing each buffered frame comprises applying a pitch preserving temporal 
compression to the buffered frames. 

29 (Original). The computer-implemented process of claim 24 wherein temporally 
compressing each buffered frame comprises decimating at least one of the buffered 
frames. 

30 (Original). The computer-implemented process of claim 24 wherein the at least 
one digital audio signal comprises a digital communications signal. 
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31 (Original). A method for capturing speech onset in a digital audio signal, 
comprising: 

sequentially analyzing and encoding chronological frames of a digital audio signal 
when an analysis of the chronological frames identifies the presence of any of speech and 
non-speech in the frames of the digital audio signal; 

buffering all chronological frames of the digital audio signal when the analysis of the 
chronological frames is unable to identify a presence of any of speech and non-speech in 
the frames of the digital audio signal; 

identifying at least one of the buffered chronological frames as having a same 
content type as a current chronological frame of the digital audio signal when the analysis 
the current chronological frame identifies the presence of any of speech and non-speech in 
the digital signal following the buffering of any chronological frames; and 

encoding the current chronological frame and at least one of the buffered 
chronological frames. 

32 (Original). The method of claim 31 further comprising temporally compressing at 
least one of the buffered frames when the analysis of the chronological frames prior to 
encoding the current chronological frame and at least one of the buffered chronological 
frames. 

33 (Original). The method of 32 further comprising searching the buffered 
chronological frames in the frame buffer, prior to temporally compressing at least one of 
the buffered chronological frames, for identifying a speech onset point within one of the 
buffered chronological frames, and wherein said search is initialized using speech 
identified in the current chronological frame. 

34 (Original). The method of claim 33 wherein buffered chronological frames 
preceding the buffered chronological frame having the speech onset point are identified as 
non-speech frames. 
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35 (Original). The method of claim 33 wherein samples of the at least one digital 
audio signal within the buffered chronological frame having the speech onset point are 
identified as non-speech samples. 

36 (Original). The method of claim 31 wherein the at least one digital audio signal 
comprises a digital communications signal in a real-time communications device. 
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